
(1) GENERAL INFORMATION: 



(i) APPLICANT: Scaglioni, Pier Paolo 
Me 1 egar i , Mar gher it a 
Wands , Jack 

(ii) TITLE OF INVENTION: INHIBITION OF HEPATITIS B REPLICATION 
(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/667,073 

(B) FILING DATE: 20-JUN-1996 

(vii) PRIOR APPLICATION DATA: 
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(B) FILING DATE: 20-JUN-1995 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Fraser, Janis K. (formerly Clark, Paul T.) 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 
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(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1041 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GAC ATA GAT CCC TAT AAA GAA TTT GGT TCA TCT TAT CAG TTG TTG 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ser Ser Tyr Gin Leu Leu 
15 10 15 
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AAT TTT CTT CCT TTG GAC TTC TTT CCT GAC CTT AAT GCT TTG GTG GAC 96 
Asn Phe Leu Pro Leu Asp Phe Phe Pro Asp Leu Asn Ala Leu Val Asp 
20 25 30 

ACT GCT ACT GCC TTG TAT GAA GAA GAG CTA ACA GGT AGG GAA CAT TGC 144 
Thr Ala Thr Ala Leu Tyr Glu Glu Glu Leu Thr Gly Arg Glu His Cys 
35 40 45 

TCT CCG CAC CAT ACA GCT ATT AGA CAA GCT TTA GTA TGC TGG GAT GAA 192 
Ser Pro His His Thr Ala He Arg Gin Ala Leu Val Cys Trp Asp Glu 
50 55 60 

TTA ACT AAA TTG ATA GCT TGG ATG AGC TCT AAC ATA ACT TCT GAA CAA 240 
Leu Thr Lys Leu He Ala Trp Met Ser Ser Asn He Thr Ser Glu Gin 
65 70 75 80 

GTA AGA ACA ATC ATA GTA AAT CAT GTC AAT GAT ACC TGG GGA CTT AAG 288 
Val Arg Thr He He Val Asn His Val Asn Asp Thr Trp Gly Leu Lys 
85 90 95 

GTG AGA CAA AGT TTA TGG TTT CAT TTG TCA TGT CTC ACT TTC GGA CAA 336 
Val Arg Gin Ser Leu Trp Phe His Leu Ser Cys Leu Thr Phe Gly Gin 
100 105 110 

CAT ACA GTT CAA GAA TTT TTA GTA AGT TTT GTA GTA TGG ATC AGA ACT 384 
His Thr Val Gin Glu Phe Leu Val Ser Phe Val Val Trp He Arg Thr 
115 120 125 

CCA GCT CCA TAT AGA CCT CCT AAT GCA CCC ATT CTC TCG ACT CTT CCG 432 
Pro Ala Pro Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

GAA CAT ACA GTC ATT AGA AGA GGA GGT GCA AGA GCT TCT AGG TCC CCC 480 
Glu His Thr Val He Arg Arg Gly Gly Ala Arg Ala Ser Arg Ser Pro 
145 150 155 160 

AGA AGA CGC ACT CCC TCT CCT CGC AGG AGA AGA TCC CAA AAT TCG CAG 528 
Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Asn Ser Gin 
165 170 175 

TTC CAA ACT TGC AAA CAC TTG CCA ACC TCC TGT CCA CCA ACT TGC AAT 576 
Phe Gin Thr Cys Lys His Leu Pro Thr Ser Cys Pro Pro Thr Cys Asn 
180 185 190 

GGC TTT CGT TGG ATG TAT CTG CGG CGT TTT ATC ATA TAC CTA TTA GTC 624 
Gly Phe Arg Trp Met Tyr Leu Arg Arg Phe He He Tyr Leu Leu Val 
195 200 205 

CTG CTG CTG TGC CTC ATC TTC TTG TTG GTT CTC CTG GAC TGG AAA GGT 672 
Leu Leu Leu Cys Leu He Phe Leu Leu Val Leu Leu Asp Trp Lys Gly 
210 215 220 

TTA ATA CCT GTC TGT CCT CTT CAA CCC ACA ACA GAA ACA ACA GTC AAT 720 
Leu He Pro Val Cys Pro Leu Gin Pro Thr Thr Glu Thr Thr Val Asn 
225 230 235 240 

TGC AGA CAA TGC ACA ATC TCT GCA CAA GAC ATG TAT ACT CCT CCT TAC 768 
Cys Arg Gin Cys Thr He Ser Ala Gin Asp Met Tyr Thr Pro Pro Tyr 
245 250 255 
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TGT TGT TGT TTA AAA CCT ACG GCA GGA AAT TGC ACT TGT TGG CCC ATC 816 
Cys Cys Cys Leu Lys Pro Thr Ala Gly Asn Cys Thr Cys Trp Pro lie 
260 265 270 

CCT TCA TCA TGG GCT TTA GGA AAT TAC CTA TGG GAG TGG GCC TTA GCT 864 
Pro Ser Ser Trp Ala Leu Gly Asn Tyr Leu Trp Glu Trp Ala Leu Ala 
275 280 285 

CGT CTC TCT TGG CTC AAT TTA CTA GTG CCC TTG CTT CAA TGG TTA GGA 912 
Arg Leu Ser Trp Leu Asn Leu Leu Val Pro Leu Leu Gin Trp Leu Gly 
290 295 300 

GGA ATT TCC CTC ATT GCG TGG TTT TTG CTT ATA TGG ATG ATT TGG TTT 960 
Gly lie Ser Leu lie Ala Trp Phe Leu Leu lie Trp Met lie Trp Phe 
305 310 315 320 

TGG GGG CCC GCA CTT CTG AGC ATC TTA CCG CCA TTT ATT CCC ATA TTT 1008 
Trp Gly Pro Ala Leu Leu Ser lie Leu Pro Pro Phe lie Pro lie Phe 
325 330 335 

GTT CTG TTT TTC TTG ATT TGG GTA TAC ATT TGA 1041 
Val Leu Phe Phe Leu lie Trp Val Tyr lie 
340 345 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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50 










55 










60 










Leu 


Thr 


Lys 


Leu 


He 


Ala 


Trp 


Met 


Ser 


Ser 


Asn 


He 


Thr 


Ser 


Glu 


Gin 


65 








70 








75 










80 


Val 


Arg 


Thr 


He 


He 


Val 


Asn 


His 


Val 


Asn 


Asp 


Thr 


Trp Gly Leu 


Lys 








85 










90 










95 




Val 


Arg 


Gin 


Ser 


Leu 


Trp 
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Leu 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCT CGC AGG ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG 48 
Pro Arg Arg Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val 
15 10 15 

GAG TTA CTC TCG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT 96 
Glu Leu Leu Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp 
20 25 30 

CTT CTA GAT ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT 144 
Leu Leu Asp Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro 
35 40 45 

GAG CAT TGT TCA CCT CAC CAT ACT GCA CTC AGG CAA GCA ATT CTT TGC 192 
Glu His Cys Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys 
50 55 60 

TGG GGG GAA CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA 240 
Trp Gly Glu Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu 
65 70 75 80 

GAT CCA GCG TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG 288 
Asp Pro Ala Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met 
85 90 95 

GGC CTA AAG TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT 336 
Gly Leu Lys Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr 
100 105 110 

TTT GGA AGA GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG 384 
Phe Gly Arg Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp 
115 120 125 
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ATT CGC ACT CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA 432 
lie Arg Thr Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser 
130 135 140 

ACA CTT CCG GAA CAT ACA GTC ATT AGA AGA GGA GGT GCA AGA GCT TCT 480 
Thr Leu Pro Glu His Thr Val lie Arg Arg Gly Gly Ala Arg Ala Ser 
145 150 155 160 

AGG TCC CCC AGA AGA CGC ACT CCC TCT CCT CGC AGG AGA AGA TCC CAA 528 
Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin 
165 170 175 

AAT TCG CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCT TGT CCT CCA 576 
Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro 
180 185 190 

ACT TGT CCT GGT TAT CGC TGG ATG TGT CTG CGG CGT TTT ATC ATC TTC 624 
Thr Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
195 200 205 

CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT CTT CTG GAC 672 
Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp 
210 215 220 

TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA TCC TCA ACA 720 
Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Ser Thr 
225 230 235 240 

ACC AGC ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT CAA GGA ACC 768 
Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin Gly Thr 
245 250 255 

TCT ATG TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC GGA AAT TGC 816 
Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys 
260 265 270 

ACC TGT ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA TTC CTA TGG 864 
Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp 
275 280 285 

GAG TGG GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG CCA TTT 912 
Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
290 295 300 

GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GTT ATA 960 
Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie 
305 310 315 320 

TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC TTG AGT CCC 1008 
Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie Leu Ser Pro 
325 330 335 

TTT TTA CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAC ATT TAA 1056 
Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE 


TYPE: protein 












(xi) SEQUENCE 


DESCRIPTION: SEQ ID NO: 4: 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAT ACT GCA CTC AGG CAA GCA ATT CTT TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG GGC CTA AAG 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT GGA AGA 336 
Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

GAG ACT ACT GTT GTT AGA CGA CGA GGC AGG TCC CCT AGA AGA AGA ACT 480 
Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

CCC TCG CCT CGC AGA CGA AGG TCT CAA TCG CCG CGT CGC AGA AGA TCT 528 
Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

CAA TCT CGG CTA GGA CCC CTT CTC GTG TTA CAG GCG GGG TTT TTC TTG 576 
Gin Ser Arg Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu 
180 185 190 

TTG ACA AGA ATC CTC ACA ATA CCG CAG AGT CTA GAC TCG TGG TGG ACT 624 
Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr 
195 200 205 

TCT CTC AAT TTT CTA GGG GGA ACT ACC GTG TGT CTT GGC CAA AAT TCG 672 
Ser Leu Asn Phe Leu Gly Gly Thr Thr Val Cys Leu Gly Gin Asn Ser 
210 215 220 

CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCT TGT CCT CCA ACT TGT 720 
Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro Thr Cys 
225 230 235 240 
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CCT GGT TAT CGC TGG ATG TGT CTG CGG CGT TTT ATC ATC TTC CTC TTC 768 
Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe 
245 250 255 

ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT CTT CTG GAC TAT CAA 816 
lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin 
260 265 270 

GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA TCC TCA ACA ACC AGC 864 
Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Ser Thr Thr Ser 
275 280 285 

ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT CAA GGA ACC TCT ATG 912 
Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin Gly Thr Ser Met 
290 295 300 

TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC GGA AAT TGC ACC TGT 960 
Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys 
305 310 315 320 

ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA TTC CTA TGG GAG TGG 1008 
lie Pro lie Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp 
325 330 335 

GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG CCA TTT GTT CAG 1056 
Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin 
340 345 350 

TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GTT ATA TGG ATG 1104 
Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie Trp Met 
355 360 365 

ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC TTG AGT CCC TTT TTA 1152 
Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie Leu Ser Pro Phe Leu 
370 375 380 

CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAC ATT TAA 1194 
Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Asp 


lie Asp 


Pro 


Tyr 


Lys 


Glu 


Phe 


Gly 


Ala 


Thr 


Val 


Glu 


Leu 


Leu 


1 






5 










10 










15 




Ser 


Phe 


Leu 


Pro 


Ser 


Asp 


Phe 


Phe 


Pro 


Ser 


Val 


Arg 


Asp 


Leu 


Leu 


Asp 








20 








25 










30 






Thr 


Ala 


Ser 


Ala 


Leu 


Tyr 


Arg 


Glu 


Ala 


Leu 


Glu 


Ser 


Pro 


Glu 


His 


Cys 






35 






40 










45 








Ser 


Pro 


His 


His 


Thr 


Ala 


Leu 


Arg 


Gin 


Ala 


He 


Leu 


Cys 


Trp 


Gly 


Glu 




50 










55 










60 










Leu 


Met 


Thr 


Leu 


Ala 


Thr Trp Val 


Gly 


Val 


Asn 


Leu 


Glu 


Asp 


Pro 


Ala 


65 










70 










75 










80 
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Ser 


Arg 


Asp 


Leu 


Val 


Val 


Ser 


Tyr 


Val 


Asn 


Thr Asn Met 


Gly 


Leu 


Lys 






85 










90 






95 




Phe 


Arg 


Gin 


Leu 


Leu 


Trp 


Phe 


His 


He 


Ser 


Cys Leu Thr 


Phe 


Gly 


Arg 






100 








105 






110 






Glu 


Thr 


Val 


He 


Glu 


Tyr 


Leu 


Val 


Ser 


Phe 


Gly Val Trp 


He 


Arg 


Thr 






115 








120 






125 








Pro 


Pro 


Ala 


Tyr 


Arg 


Pro 


Pro 


Asn 


Ala 


Pro 


He Leu Ser 


Thr 


Leu 


Pro 




130 






135 








140 








Glu 


Thr 


Thr 


Val 


Val 


Arg 


Arg 


Arg 


Gly Arg 


Ser Pro Arg 


Arg 


Arg 


Thr 


145 










150 










155 






160 


Pro 


Ser 


Pro 


Arg 


Arg 
165 


Arg 


Arg 


Ser 


Gin 


Ser 
170 


Pro Arg Arg 


Arg 


Arg 
175 


Ser 


Gin 


Ser 


Arg 


Leu 


Gly 


Pro 


Leu 


Leu 


Val 


Leu 


Gin Ala Gly 


Phe 


Phe 


Leu 






180 








185 






190 






Leu 


Thr 


Arg 
195 


He 


Leu 


Thr 


He 


Pro 
200 


Gin 


Ser 


Leu Asp Ser 
205 


Trp 


Trp 


Thr 


Ser 


Leu 
210 


Asn 


Phe 


Leu 


Gly 


Gly 
215 


Thr 


Thr 


Val 


Cys Leu Gly 
220 


Gin 


Asn 


Ser 


Gin 


Ser 


Pro 


Thr 


Ser 


Asn 


His 


Ser 


Pro 


Thr 


Ser Cys Pro 


Pro 


Thr 


Cys 


225 










230 










235 






240 


Pro 


Gly 


Tyr 


Arg 


Trp 


Met 


Cys 


Leu 


Arg Arg 


Phe He He 


Phe 


Leu 


Phe 








245 










250 






255 




He 


Leu 


Leu 


Leu 


Cys 


Leu 


He 


Phe 


Leu 


Leu 


Val Leu Leu 


Asp 


Tyr 


Gin 








260 








265 






270 






Gly 


Met 


Leu 


Pro 


Val 


Cys 


Pro 


Leu 


He 


Pro 


Gly Ser Ser 


Thr 


Thr 


Ser 




275 








280 






285 








Thr 


Gly 
290 


Pro 


Cys 


Arg 


Thr 


Cys 
295 


Met 


Thr 


Thr 


Ala Gin Gly 
300 


Thr 


Ser 


Met 


Tyr 


Pro 


Ser 


Cys 


Cys 


Cys 


Thr 


Lys 


Pro 


Ser 


Asp Gly Asn 


Cys 


Thr 


Cys 


305 










310 










315 






320 


He 


Pro 


He 


Pro 


Ser 


Ser 


Trp 


Ala 


Phe Gly 


Lys Phe Leu 


Trp 


Glu 


Trp 










325 








330 






335 




Ala 


Ser 


Ala 


Arg 


Phe 


Ser 


Trp 


Leu 


Ser 


Leu 


Leu Val Pro 


Phe 


Val 


Gin 








340 








345 






350 






Trp 


Phe 


Val 


Gly 


Leu 


Ser 


Pro 


Thr 


Val 


Trp 


Leu Ser Val 


He 


Trp 


Met 




355 








360 






365 








Met 


Trp 
370 


Tyr 


Trp 


Gly 


Pro 


Ser 
375 


Leu 


Tyr 


Ser 


He Leu Ser 
380 


Pro 


Phe 


Leu 


Pro 


Leu 


Leu 


Pro 


He 


Phe 


Phe 


Cys 


Leu 


Trp 


Val Tyr He 









385 390 395 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 
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ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAT ACT GCA CTC AGG CAA GCA ATT CTT TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG GGC CTA AAG 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT GGA AGA 336 
Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

GAG ACT ACT GTT GTT AGA CGA CGA GGC AGG TCC CCT AGA AGA AGA ACT 480 
Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

CCC TCG CCT CGC AGA CGA AGG TCT CAA TCG CCG CGT CGC AGA AGA TCC 528 
Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

TCA ACA ACC AGC ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT CAA 576 
Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin 
180 185 190 

GGA ACC TCT ATG TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC GGA 624 
Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly 
195 200 205 

AAT TGC ACC TGT ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA TTC 672 
Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Gly Lys Phe 
210 215 220 

CTA TGG GAG TGG GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG 720 
Leu Trp Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val 
225 230 235 240 

CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA 768 
Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser 
245 250 255 

GTT ATA TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC TTG 816 
Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie Leu 
260 265 270 
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AGT CCC TTT TTA CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAC 864 
Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr 
275 280 285 

ATT TAA 870 
He 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 


Asp 


He 


Asp 


Pro 


Tyr 


Lys 


Glu 


Phe 


Gly 


Ala 


Thr 


Val 


Glu 


Leu 


Leu 


1 




5 










10 










15 




Ser 


Phe 


Leu 


Pro 


Ser 


Asp 


Phe 


Phe 


Pro 


Ser 


Val 


Arg 


Asp 


Leu 


Leu 


Asp 








20 








25 










30 






Thr 


Ala 


Ser 


Ala 


Leu 


Tyr 


Arg 


Glu 


Ala 


Leu 


Glu 


Ser 


Pro 


Glu 


His 


Cys 






35 






40 










45 








Ser 


Pro 
50 


His 


His 


Thr 


Ala 


Leu 
55 


Arg 


Gin 


Ala 


He 


Leu 
60 


Cys 


Trp 


Gly 


Glu 


Leu 


Met 


Thr 


Leu 


Ala 


Thr 


Trp 


Val 


Gly 


Val 


Asn 


Leu 


Glu 


Asp 


Pro 


Ala 


65 










70 








75 










80 


Ser 


Arg 


Asp 


Leu 


Val 


Val 


Ser 


Tyr 


Val 


Asn 


Thr 


Asn 


Met 


Gly 


Leu 


Lys 






85 










90 










95 




Phe 


Arg 


Gin 


Leu 


Leu 


Trp 


Phe 


His 


He 


Ser 


Cys 


Leu 


Thr 


Phe 


Gly 


Arg 






100 








105 










110 






Glu 


Thr 


Val 


He 


Glu 


Tyr 


Leu 


Val 


Ser 


Phe 


Gly Val 


Trp 


He 


Arg 


Thr 






115 








120 










125 








Pro 


Pro 


Ala 


Tyr 


Arg 


Pro 


Pro 


Asn 


Ala 


Pro 


He 


Leu 


Ser 


Thr 


Leu 


Pro 




130 






135 










140 










Glu 


Thr 


Thr 


Val 


Val 


Arg 


Arg 


Arg 


Gly 


Arg 


Ser 


Pro 


Arg 


Arg 


Arg 


Thr 


145 










150 










155 










160 


Pro 


Ser 


Pro 


Arg 


Arg 


Arg 


Arg 


Ser 


Gin 


Ser 


Pro 


Arg 


Arg 


Arg 


Arg 


Ser 








165 










170 










175 




Ser 


Thr 


Thi: 


Ser 


Thr 


Gly 


Pro 


Cys 


Arg 


Thr 


Cys 


Met 


Thr 


Thr 


Ala 


Gin 








180 






185 










190 






Gly 


Thr 


Ser 


Met 


Tyr 


Pro 


Ser 


Cys 


Cys 


Cys 


Thr 


Lys 


Pro 


Ser 


Asp 


Gly 




195 








200 










205 








Asn 


Cys 


Thr 


Cys 


He 


Pro 


He 


Pro 


Ser 


Ser 


Trp Ala 


Phe 


Gly 


Lys 


Phe 




210 








215 










220 










Leu 


Trp 


Glu 


Trp 


Ala 


Ser 


Ala 


Arg 


Phe 


Ser 


Trp 


Leu 


Ser 


Leu 


Leu 


Val 


225 






230 










235 










240 


Pro 


Phe 


Val 


Gin 


Trp 
245 


Phe 


Val 


Gly 


Leu 


Ser 
250 


Pro 


Thr 


Val 


Trp 


Leu 
255 


Ser 


Val 


He 


Trp 


Met 


Met 


Trp 


Tyr 


Trp 


Gly 


Pro 


Ser 


Leu 


Tyr 


Ser 


He 


Leu 






260 










265 










270 






Ser 


Pro 


Phe 
275 


Leu 


Pro 


Leu 


Leu 


Pro 
280 


He 


Phe 


Phe 


Cys 


Leu 
285 


Trp 


Val 


Tyr 



He 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG GAT ATC AAT GCT TCT AG A GCC TTA GCC AAT GTG TAT GAT CTA CCA 48 
Met Asp lie Asn Ala Ser Arg Ala Leu Ala Asn Val Tyr Asp Leu Pro 
15 10 15 

GAT GAT TTC TTT CCA AAA ATA GAT GAT CTT GTT AGA GAT GCT AAA GAC 96 
Asp Asp Phe Phe Pro Lys lie Asp Asp Leu Val Arg Asp Ala Lys Asp 
20 25 30 

GCT TTA GAG CCT TAT TGG AAA TCA GAT TCA ATA AAG AAA CAT GTT TTG 144 
Ala Leu Glu Pro Tyr Trp Lys Ser Asp Ser lie Lys Lys His Val Leu 
35 40 45 

ATT GCA ACT CAC TTT GTG GAT CTT ATT GAA GAC TTC TGG CAG ACT ACA 192 
lie Ala Thr His Phe Val Asp Leu lie Glu Asp Phe Trp Gin Thr Thr 
50 55 60 

CAG GGC ATG CAT GAA ATA GCC GAA TCA TTA AGA GCT GTT ATA CCT CCC 240 
Gin Gly Met His Glu lie Ala Glu Ser Leu Arg Ala Val lie Pro Pro 
65 70 75 80 

ACT ACT ACT CCT GTT CCA CCG GGT TAT CTT ATT CAG CAC GAA GAA GCT 288 
Thr Thr Thr Pro Val Pro Pro Gly Tyr Leu lie Gin His Glu Glu Ala 
85 90 95 

GAA GAG ATA CCT TTG GGA GAT TTA TTT AAA CAC CAA GAA GAA AGG ATA 336 
Glu Glu lie Pro Leu Gly Asp Leu Phe Lys His Gin Glu Glu Arg lie 
100 105 110 

GTG AGT TTC CAA CCC GAC TAT CCG ATT ACG GCT AGA ATT CAT GCT CAT 384 
Val Ser Phe Gin Pro Asp Tyr Pro lie Thr Ala Arg lie His Ala His 
115 120 125 

TTG AAA GCT TAT GCA AAA ATT AAC GAG GAA TCA CTG GAT AGG GCT AGG 432 
Leu Lys Ala Tyr Ala Lys lie Asn Glu Glu Ser Leu Asp Arg Ala Arg 
130 135 140 

AGA TTG CTT TGG TGG CAT TAC AAC TGT TTA CTG TGG GGA GAA GCT CAA 480 
Arg Leu Leu Trp Trp His Tyr Asn Cys Leu Leu Trp Gly Glu Ala Gin 
145 150 155 160 

GTT ACT AAC TAT ATT TCT CGC TTG CGT ACT TGG TTG TCA ACT CCT GAG 528 
Val Thr Asn Tyr lie Ser Arg Leu Arg Thr Trp Leu Ser Thr Pro Glu 
165 170 175 

AAA TAT AGA GGT AGA GAT GCC CCG ACC ATT GAA GCA ATC ACT AGA CCA 576 
Lys Tyr Arg Gly Arg Asp Ala Pro Thr lie Glu Ala lie Thr Arg Pro 
180 185 190 
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ATC CAG GTG GCT CAG GGA GGC CGA AAA ACA ACT ACG GGT ACT AGA AAA 624 
He Gin Val Ala Gin Gly Gly Arg Lys Thr Thr Thr Gly Thr Arg Lys 
195 200 205 

CCT CGT GGA CTC GAA CCT AGA AGA AGA AAA GTT AAA ACC ACA GTT GTC 672 
Pro Arg Gly Leu Glu Pro Arg Arg Arg Lys Val Lys Thr Thr Val Val 
210 215 220 

TAT GGG AGA AGA CGT TCA AAG TCC CGG GGA AGG AGA GCC CCT ACA CCC 720 
Tyr Gly Arg Arg Arg Ser Lys Ser Arg Gly Arg Arg Ala Pro Thr Pro 
225 230 235 240 

CAA CGT GCG GGC TCC CCT CTC CCA CGT AGT TCG AGC AGC CAC CAT AGA 768 
Gin Arg Ala Gly Ser Pro Leu Pro Arg Ser Ser Ser Ser His His Arg 
245 250 255 

TCC TTC GGG GGA ATA CTA GCT GGC CTA ATC GGA TTA CTG GTA AGC TTT 816 
Ser Phe Gly Gly He Leu Ala Gly Leu He Gly Leu Leu Val Ser Phe 
260 265 270 

TTC TTG TTG ATA AAA ATT CTA GAA ATA CTG AGG AGG CTA GAT TGG TGG 864 
Phe Leu Leu He Lys He Leu Glu He Leu Arg Arg Leu Asp Trp Trp 
275 280 285 

TGG ATT TCT CTC AGT TCT CCA AAG GGA AAA ATG CAA TGC GCT TTC CAA 912 
Trp He Ser Leu Ser Ser Pro Lys Gly Lys Met Gin Cys Ala Phe Gin 
290 295 300 

GAT ACT GGA GCC CAA ATC TCT CCA CAT TAC GTC GGA TCT TGC CCG TGG 960 
Asp Thr Gly Ala Gin He Ser Pro His Tyr Val Gly Ser Cys Pro Trp 
305 310 315 320 

GGA TGC CCA GGA TTT CTT TGG ACC TAT CTC AGG CTT TTT ATC ATC TTC 1008 
Gly Cys Pro Gly Phe Leu Trp Thr Tyr Leu Arg Leu Phe He He Phe 
325 330 335 

CTC TTA ATC CTG CTA GTA GCA GCA GGC TTG CTG TAT CTG ACG GAC AAC 1056 
Leu Leu He Leu Leu Val Ala Ala Gly Leu Leu Tyr Leu Thr Asp Asn 
340 345 350 

GGG TCT ACT ATT TTA GGA AAG CTC CAA TGG GCG TCG GTC TCA GCC CTT 1104 
Gly Ser Thr He Leu Gly Lys Leu Gin Trp Ala Ser Val Ser Ala Leu 
355 360 365 

TTC TCC TCC ATC TCT TCA CTA CTG CCC TCG GAT CCG AAA TCT CTC GTC 1152 
Phe Ser Ser He Ser Ser Leu Leu Pro Ser Asp Pro Lys Ser Leu Val 
370 375 380 

GCT TTA ACG TTT GGA CTT TCA CTT ATA TGG ATG ACT TCC TCC TCT GCC 1200 
Ala Leu Thr Phe Gly Leu Ser Leu He Trp Met Thr Ser Ser Ser Ala 
385 390 395 400 

ACC CAA ACG CTC GTC ACC TTA ACG CAA TTA GCC ACG CTG TCT GCT CTT 1248 
Thr Gin Thr Leu Val Thr Leu Thr Gin Leu Ala Thr Leu Ser Ala Leu 
405 410 415 

TTT TAC AAG AGC TAG 1263 
Phe Tyr Lys Ser 
420 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met 


Asp 


He 


Asn 


Ala 


Ser 


Arg 


Ala 


Leu 


Ala 


Asn 


Val 


Tyr 


Asp 


Leu 


Pro 


1 






5 










10 










15 




Asp 


Asp 


Phe 


Phe 


Pro 


Lys 


He 


Asp 


Asp 


Leu 


Val 


Arg 


Asp 


Ala 


Lys 


Asp 




20 










25 










30 






Ala 


Leu 


Glu 
35 


Pro 


Tyr 


Trp 


Lys 


Ser 
40 


Asp 


Ser 


lie 


Lys 


Lys 
45 


His 


Val 


Leu 


He 


Ala 
50 


Thr 


His 


Phe 


Val 


Asp 
55 


Leu 


lie 


Glu 


Asp 


Phe 
60 


Trp 


Gin 


Thr 


Thr 


Gin 


Gly 


Met 


His 


Glu 


He 


Ala 


Glu 


Ser 


Leu 


Arg 


Ala 


Val 


He 


Pro 


Pro 


65 








70 










75 










80 


Thr 


Thr 


Thr 


Pro 


Val 


Pro 


Pro 


Gly 


Tyr 


Leu 


lie 


Gin 


His 


Glu 


Glu 


Ala 










85 






90 










95 




Glu 


Glu 


He 


Pro 


Leu 


Gly 


Asp 


Leu 


Phe 


Lys 


His 


Gin 


Glu 


Glu 


Arg 


lie 








100 






105 










110 






Val 


Ser 


Phe 


Gin 


Pro 


Asp 


Tyr 


Pro 


He 


Thr 


Ala 


Arg 


He 


His 


Ala 


His 






115 






120 










125 








Leu 


Lys 


Ala 


Tyr 


Ala 


Lys 


He 


Asn 


Glu 


Glu 


Ser 


Leu 


Asp Arg 


Ala 


Arg 




130 








135 










140 










Arg 


Leu 


Leu 


Trp 


Trp 


His 


Tyr 


Asn 


Cys 


Leu 


Leu 


Trp 


Gly Glu 


Ala 


Gin 


145 








150 










155 










160 


Val 


Thr 


Asn 


Tyr 


He 


Ser 


Arg 


Leu 


Arg 


Thr 


Trp 


Leu 


Ser 


Thr 


Pro 


Glu 








165 










170 










175 




Lys 


Tyr 


Arg 


Gly 


Arg 


Asp 


Ala 


Pro 


Thr 


lie 


Glu 


Ala 


lie 


Thr 


Arg 


Pro 




180 










185 










190 






He 


Gin 


Val 


Ala 


Gin 


Gly 


Gly 


Arg 


Lys 


Thr 


Thr 


Thr 


Gly Thr 


Arg 


Lys 






195 










200 










205 








Pro 


Arg 


Gly 


Leu 


Glu 


Pro 


Arg 


Arg 


Arg 


Lys 


Val 


Lys 


Thr 


Thr 


Val 


Val 




210 








215 










220 










Tyr 


Gly 


Arg 


Arg 


Arg 


Ser 


Lys 


Ser 


Arg 


Gly 


Arg 


Arg 


Ala 


Pro 


Thr 


Pro 


225 






230 










235 










240 


Gin 


Arg 


Ala 


Gly 


Ser 


Pro 


Leu 


Pro 


Arg 


Ser 


Ser 


Ser 


Ser 


His 


His 


Arg 






245 










250 










255 




Ser 


Phe 


Gly 


Gly 


He 


Leu 


Ala 


Gly 


Leu 


He 


Gly 


Leu 


Leu 


Val 


Ser 


Phe 






260 










265 










270 






Phe 


Leu 


Leu 


He 


Lys 


He 


Leu 


Glu 


lie 


Leu 


Arg 


Arg 


Leu 


Asp 


Trp 


Trp 






275 








280 










285 








Trp 


He 


Ser 


Leu 


Ser 


Ser 


Pro 


Lys 


Gly 


Lys 


Met 


Gin 


Cys 


Ala 


Phe 


Gin 


290 










295 










300 










Asp 


Thr 


Gly 


Ala 


Gin 


lie 


Ser 


Pro 


His 


Tyr 


Val 


Gly 


Ser 


Cys 


Pro 


Trp 


305 








310 










315 










320 


Gly 


Cys 


Pro 


Gly 


Phe 


Leu 


Trp 


Thr 


Tyr 


Leu 


Arg 


Leu 


Phe 


He 


He 


Phe 






325 










330 










335 




Leu 


Leu 


He 


Leu 
340 


Leu 


Val 


Ala 


Ala 


Gly 
345 


Leu 


Leu 


Tyr 


Leu 


Thr 
350 


Asp 


Asn 


Gly 


Ser 


Thr 


He 


Leu 


Gly 


Lys 


Leu 


Gin 


Trp 


Ala 


Ser 


Val 


Ser 


Ala 


Leu 




355 










360 










365 








Phe 


Ser 
370 


Ser 


He 


Ser 


Ser 


Leu 
375 


Leu 


Pro 


Ser 


Asp 


Pro 
380 


Lys 


Ser 


Leu 


Val 


Ala 


Leu 


Thr 


Phe 


Gly 


Leu 


Ser 


Leu 


lie 


Trp 


Met 


Thr 


Ser 


Ser 


Ser 


Ala 


385 








390 










395 










400 
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Thr Gin Thr Leu Val Thr Leu Thr Gin Leu Ala Thr Leu Ser Ala Leu 
405 410 415 

Phe Tyr Lys Ser 
420 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTC GAG TTA CTC 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAT ACT GCA CTC AGG CAA GCA ATT CTT TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG GGC CTA AAG 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT GGA ACA 336 
Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Thr 
100 105 110 

GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

GAG ACT ACT GTT GTT AGA CGA CCA GGC AGG TCC CCT AGA AGA AGA ACT 480 
Glu Thr Thr Val Val Arg Arg Pro Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

CCC TCG CCT CGC AGA CGA AGG TCT CAA TCG CCC CGT CGC AGA AGA TCT 528 
Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 
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CAA TCT CGG GAA TCT CAA TGT TAG 552 
Gin Ser Arg Glu Ser Gin Cys 
180 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met Asp 


He 


Asp 


Pro 


Tyr 


Lys 


Glu 


Phe 


Gly 


Ala 


Thr 


Val 


Glu 


Leu 


Leu 


1 








5 










10 










15 




Ser 


Phe 


Leu 


Pro 


Ser 


Asp 


Phe 


Phe 


Pro 


Ser 


Val 


Arg 


Asp 


Leu 


Leu Asp 








20 








25 










30 






Thr 


Ala 


Ser 


Ala 


Leu 


Tyr 


Arg 


Glu 


Ala 


Leu 


Glu 


Ser 


Pro 


Glu 


His 


Cys 






35 








40 










45 








Ser 


Pro 


His 


His 


Thr 


Ala 


Leu 


Arg 


Gin 


Ala 


He 


Leu 


Cys 


Trp 


Gly Glu 




50 










55 










60 










Leu 


Met 


Thr 


Leu 


Ala 


Thr 


Trp 


Val 


Gly 


Val 


Asn 


Leu 


Glu 


Asp 


Pro 


Ala 


65 










70 










75 










80 


Ser 


Arg 


Asp 


Leu 


Val 


Val 


Ser 


Tyr 


Val 


Asn 


Thr 


Asn 


Met 


Gly 


Leu 


Lys 






85 










90 










95 




Phe 


Arg 


Gin 


Leu 


Leu 


Trp 


Phe 


His 


He 


Ser 


Cys 


Leu 


Thr 


Phe 


Gly 


Thr 






100 










105 










110 






Glu 


Thr 


Val 


He 


Glu 


Tyr 


Leu 


Val 


Ser 


Phe 


Gly 


Val 


Trp 


He 


Arg 


Thr 






115 








120 










125 








Pro 


Pro 


Ala 


Tyr 


Arg 


Pro 


Pro 


Asn 


Ala 


Pro 


He 


Leu 


Ser 


Thr 


Leu 


Pro 




130 






135 










140 










Glu 


Thr 


Thr 


Val 


Val 


Arg 


Arg 


Pro Gly 


Arg 


Ser 


Pro 


Arg 


Arg 


Arg 


Thr 


145 










150 










155 










160 


Pro 


Ser 


Pro 


Arg 


Arg 
165 


Arg 


Arg 


Ser 


Gin 


Ser 
170 


Pro 


Arg 


Arg 


Arg 


Arg 
175 


Ser 


Gin 


Ser 


Arg 


Glu 


Ser 


Gin 


Cys 





















180 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GAG AAC ATC ACA TCA GGA TTC CTA GGA CCC CTT CTC GTG TTA CAG 48 
Met Glu Asn He Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
15 10 15 

GCG GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA CCC CAG AGT CTA 96 
Ala Gly Phe Phe Leu Leu Thr Arg He Leu Thr He Pro Gin Ser Leu 
20 25 30 
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GAC TCG TGG TGG ACT TCT CTC AAT TTT CTA GGG GGA ACT ACC GTG TGT 144 
Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Thr Thr Val Cys 
35 40 45 

CTT GGC CAA AAT TCG CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCT 192 
Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
50 55 60 

TGT CCT CCA ACT TGT CCT GGT TAT CGC TGG ATG TGT CTG CGG CGT TTT 240 
Cys Pro Pro Thr Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
65 70 75 80 

ATC ATC TTC CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT 288 
He He Phe Leu Phe He Leu Leu Leu Cys Leu He Phe Leu Leu Val 
85 90 95 

CTT CTG GAC TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA 336 
Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu He Pro Gly 
100 105 110 

TCC TCA ACA ACC AGC ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT 384 
Ser Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala 
115 120 125 

CAA GGA ACC TCT ATG TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC 432 
Gin Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
130 135 140 

GGA AAT TGC ACC TGT ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA 480 
Gly Asn Cys Thr Cys He Pro He Pro Ser Ser Trp Ala Phe Gly Lys 
145 150 155 160 

TTC CTA TGG GAG TGG GCC TCA GCC CCT TTC TCC TGG CTC AGT TTA CTA 528 
Phe Leu Trp Glu Trp Ala Ser Ala Pro Phe Ser Trp Leu Ser Leu Leu 
165 170 175 

GTC CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT 576 
Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
180 185 190 

TCA GTT ATA TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC 624 
Ser Val He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He 
195 200 205 

TTG AGT CCC TTT TTA CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA 672 
Leu Ser Pro Phe Leu Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val 
210 215 220 

TAC ATT TAA 681 

Tyr He 

225 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE 


DESCRIPTION 


Met 


G1U 


Asn 


lie 


inr 


Ser 


viy 




1 








c 
D 








Ala 


Gly 


pne 


pne 


Leu 


Leu 


inr 


Arg 
















Asp 


aer 


Trp 


Trp 


i nr 


Ser 


Leu 


Aon 
noil 


















Leu 


Gly 


Gin 


Asn 


Ser 


Gin 


Ser 


Pro 




50 










bb 




Cys 


Pro 


Pro 


Thr 


Cys 


Pro 


t*iy 


Tyr 


65 










70 






lie 


lie 


Pne 


Leu 


pne 


i ie 


Leu 


Leu 










85 








Leu 


Leu 


Asp 


Tyr 


Gin 


Gly 


Met 


Leu 








100 










Ser 


Ser 


Thr 


Thr 


Ser 


Thr 


Gly 


Pro 






115 










120 


Gin 


Gly 


Thr 


Ser 


Met 


Tyr 


Pro 


Ser 




130 










lob 




Gly 


Asn 


Cys 


Thr 


Cys 


He 


Pro 


Tl - 

lie 


145 










15U 






Phe 


Leu 


Trp 


blU 


Trp 


nld 




nia 










"I £ C 

loo 








Val 


Pro 


Phe 


Val 


Gin 


Trp 


Phe 


Val 








180 










Ser 


Val 


He 


Trp 


Met 


Met 


Trp 


Tyr 






195 










200 


Leu 


Ser 


Pro 


Phe 


Leu 


Pro 


Leu 


Leu 




210 










215 




Tyr 


He 














225 

















SEQ ID NO: 14: 



Leu 


Gly 
10 


Pro 


Leu 


Leu 


Val 


Leu 
15 


Gin 


He 


Leu 


Thr 


He 


Pro 


Gin 


Ser 


Leu 


25 










30 






Phe 


Leu 


Gly 


Gly 


Thr 
45 


Thr 


Val 


Cys 


Thr 


Ser 


Asn 


His 
60 


Ser 


Pro 


Thr 


Ser 


Arg 


Trp 


Met 
75 


Cys 


Leu 


Arg 


Arg 


Phe 
80 


Leu 


Cys 
90 


Leu 


He 


Phe 


Leu 


Leu 
95 


Val 


Pro 


Val 


Cys 


Pro 


Leu 


He 


Pro 


Gly 


105 








110 






Cys 


Arg 


Thr 


Cys 


Met 


Thr 


Thr 


Ala 








125 








Cys 


Cys 


Cys 


Thr 
140 


Lys 


Pro 


Ser 


Asp 


Pro 


Ser 


Ser 
155 


Trp 


Ala 


Phe 


Gly 


Lys 
160 


Pro 


Phe 
170 


Ser 


Trp 


Leu 


Ser 


Leu 
175 


Leu 


Gly 


Leu 


Ser 


Pro 


Thr 


Val 


Trp 


Leu 


185 










190 






Trp 


Gly 


Pro 


Ser 


Leu 


Tyr 


Ser 


He 








205 








Pro 


He 


Phe 


Phe 


Cys 


Leu 


Trp 


Val 



220 
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